Concept drift detection in event logs using statistical information of variants

نویسندگان

چکیده مقاله:

In recent years, business process management (BPM) has been highly regarded as an improvement in the efficiency and effectiveness of organizations. Extracting and analyzing information on business processes is an important part of this structure. But these processes are not sustainable over time and may change for a variety of reasons, such as the environment and human resources. These changes in processes are referred to as concept drift. The discovery of concept drifts is one of the challenges in business process management. These drifts may occur suddenly, gradually, periodically or incrementally. This paper proposes an algorithm for identifying concept drifts in event log, based on the distribution of trace variants in the execution of processes. In this method, by moving two windows on the event log, two feature vectors are derived from the two windows trace variants. Then variants of the two windows are compared by applying statistical tests and finally the drifts are identified. Experiments on artificial databases show the correctness of the method and its superiority to the previous methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Concept drift detection in business process logs using deep learning

Process mining provides a bridge between process modeling and analysis on the one hand and data mining on the other hand. Process mining aims at discovering, monitoring, and improving real processes by extracting knowledge from event logs. However, as most business processes change over time (e.g. the effects of new legislation, seasonal effects and etc.), traditional process mining techniques ...

متن کامل

Finding Process Variants in Event Logs

The analysis of event data is particularly challenging when there is a lot of variability. Often, there are different variants of the same process, thus cluttering the overall representations of these processes, such as process models. Therefore, it is important to automatically detect process variants and support the analysis of individual variants and their comparison. Existing approaches can...

متن کامل

Comparing Business Process Variants Using Models and Event Logs

Organizations realize that benefits can be achieved by closely working together on the design of their business processes. But even when there is a joint design for a particular business process, the way individual organizations carry out that process may differ – either wittingly or unwittingly. This paper proposes an analytical approach that helps to compare how different organizations execut...

متن کامل

Concept Drift Detection Using Online Bayesian Classifier

In data classification the goal is to predict the category of novel instances based on a collection of exemplars whose respective categories are known a priori. The state-of-theart includes various algorithms to solve this problem, including Naive Bayes, Random Forest, Support Vector Machines (SVM), among others. Most of these classifiers consider that the statistical data distribution remains ...

متن کامل

Adaptive Concept Drift Detection

Concept drift is an important problem in the context of machine learning and data mining. It can be described as a change in the fundamental concepts underlying the data, or, in its most basic form, as a significant change in the distribution of the data. From a learning theoretic point of view, one can say that concept drift is a violation of the i.i.d. assumption, which states that each examp...

متن کامل

Detecting Concept Drift in Data Stream Using Semi-Supervised Classification

Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}


عنوان ژورنال

دوره 19  شماره 1

صفحات  0- 0

تاریخ انتشار 2022-05

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

کلمات کلیدی

کلمات کلیدی برای این مقاله ارائه نشده است

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023